Highly Efficient Asynchronous Execution of Large-Grained Parallel Programs
نویسندگان
چکیده
An n-thread parallel program P is large-grained if in every parallel step the computations on each of the threads are complex procedures requiring numerous processor instructions. This practically relevant style of programs diiers from PRAM programs in its large granularity and the possibility that within a parallel step the computations on diierent threads may considerably vary in size. Let M be an n-processor asyn-chronous parallel system, with no restriction on the degree of asynchrony and without any specialized synchronization mechanisms. It is a challenging theoretical as well as practically important problem to ensure correct execution of P on such a parallel machine. Let P be a large-grained program requiring total work W for its execution on a synchronous n-processor parallel system. We present a transformation (compilation) of P into a program C(P) which correctly and eeciently eeects the computation of P on the asynchronous machine M. Under moderate assumptions on the granu-larity of threads and the size of the program variables, execution of C(P) requires just O(W log n) expected total work, and the memory space overhead is a small multiplicative constant. This result is the rst of its kind. The solution involves a number of new concepts and methods. These include methods for storing program and control variables, employing a combination of error correction codes with phase-dependent hashing into memory. We feel that these methods for storing data will have signiicant practical applications to storage of data on Disk Arrays (RAIDS), as well as additional theoretical implications. The signiicance of the present work to parallel data-processing programs and large scale parallel numerical computations is obvious.
منابع مشابه
Scheduling DAG's for Asynchronous Multiprocessor Execution
A new approach is given for scheduling a sequential instruction stream for execution “in parallel” on asynchronous multiprocessors. The key idea in our approach is to exploit the fine grained parallelism present in the instruction stream. In this context, schedules are constructed by a careful balancing of execution and communication costs at the level of individual instructions, and their data...
متن کاملAsynchronous Parallel Simulation of Parallel Programs
Parallel simulation of parallel programs for large datasets has been shown to ooer signiicant reduction in the execution time of many discrete event models. This paper describes the design and implementation of MPI-SIM, a library for the execution driven parallel simulation of task and data parallel programs. MPI-SIM can be used to predict the performance of existing programs written using MPI ...
متن کاملFine-grained Parallel Computation in the PM/OM Object Model
In this paper a novel computation model based on fine-grained objects and asynchronous procedure invocation is presented. Based on a set of only a few basic definitions it can be used to write well structured programs as well as exploit a large degree of potential parallelism. Yet the model is strong enough to express complicated subjects such as active objects and coordination patterns. By inc...
متن کاملProfiling Techniques for a Fine-Grained Parallel Language
Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the fine-grained parallel programming language Split-C, which provides a simple global address space m...
متن کاملParallelization Strategies for Variable Neighborhood Search
We analyze five parallelization strategies for the Variable Neighborhood Search (VNS) meta-heuristic. They are based on asynchronous cooperation of several search threads. We tested parallelization on various levels, from a low-level parallel neighborhood exploration, through medium-grained asynchronous execution of basic VNS steps (shaking and local search), to the coarse-grained asynchronous ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993